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g ■ ABSTRACT 

o . 

. If gamma-ray bursts originate in galaxies at cosmological distances, the host 

^ ! galaxy should be detected if a burst error box is searched deep enough; are the 

P_|! host galaxies present? We present and implement a statistical methodology 

which evaluates whether the observed galaxy detections in a burst's error box 
are consistent with the presence of the host galaxy, or whether all the detections 



O 



c§ ! can be attributed to unrelated background galaxies. This methodology requires 

the model-dependent distribution of host galaxy fluxes. While our methodology 
^ , was derived for galaxies in burst error boxes, it can be applied to other 

^ \ candidate host objects (e.g., active galaxies) and to other types of error boxes. 

As examples, we apply this methodology to two published studies of burst error 
boxes. We find that the nine error boxes observed by Larson & McLean (1997) 
are too large to discriminate between the presence or absence of host galaxies, 
while the absence of bright galaxies in the four significantly smaller error boxes 
observed by HST (Schaefer et al. 1997) does confirm that there is a "no-host 
galaxy" problem within the "minimal" host galaxy model. 



Subject headings: gamma-rays: bursts — methods: statistical 
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1. INTRODUCTION 

If gamma-ray bursts originate at cosmological distances then they most Ukely occur 
in (or near) galaxies; are the host galaxies present in burst error boxes? There have been 
various claims as to whether the error boxes which have been searched in the optical band 
contain galaxies bright enough to be the expected hosts of the burst sources. The question 
is whether the observations — the galaxies observed above the detection threshold — are 
consistent with the presence of the expected host galaxy when unrelated "background" 
galaxies are also be present. Here we present and implement a methodology to evaluate 
this question. 

The "no- host galaxy" problem was first raised by Schaefer (1992) who presented a 
compendium of brightness upper limits for galaxies in the error boxes of 8 classical bursts, 

the soft gamma-repeater associated with the 1979 March 5 event and 4 optical transients. 
For each error box Schaefer calculated the distance to a galaxy with the luminosity of M31 if 
the galaxy were as bright as the detection threshold for that error box. The distances were 
typically a few Gpc (a Gpc corresponds to z = 0.25 for Hq = 75 km s~^ Mpc~^), requiring 
isotropic radiated energies greater than 10^^ erg in some cases. Note that Schaefer did not 
claim that there are no galaxies in the error boxes, just that there are no bright galaxies. 
Indeed, he used the brightest object in his field as the upper Hmit for the brightest galaxy. 

Subsequently Fenimore et al. (1993) analyzed the PVO-BATSE cumulative burst 
intensity distribution under the assumption that bursts are cosmological, and assigned 
distances to the 8 classical bursts in Schaefer's sample based on the bursts' intensities. 
Using the distance estimate and Schaefer's brightness hmit for each error box, Fenimore et 
al. calculated first the upper limit to the host galaxy's luminosity and then the fraction 
of the host galaxy himinosity function which is fainter than this limit. They derived the 
host galaxy luminosity function as the normal galaxy luminosity function weighted by 
the luminosity since the number of potential burst sources in a galaxy presumably scales 
with the number of stars in the galaxy, and therefore with the luminosity (for a constant 
mass-to-light ratio). If only host galaxies are present, the fraction of the host galaxy 
luminosity function less than the observed host galaxy's luminosity should be distributed 
uniformly between and 1, with an average of 1/2. Indeed, the analysis of Fenimore et 
al. gives an average value of 0.44±0.10 for the upper hmits calculated from Schaefer's 
compendium. However, the average value of this fraction for the host galaxies' actual 
luminosities is undoubtedly smaller (at least some of the host galaxies must be fainter than 
the upper limits), and Fenimore et al. concluded that the observations were only marginally 
consistent with bursts occurring in galaxies. 

Vrba, Hartmann, & Jennings (1995) monitored the error boxes of 7 classical bursts 
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and one optical transient for 5 years; many of the burst error boxes were included in the 
compendium of Schaefer (1992). Vrba et al. searched for, but did not find, unusual objects 
which varied or had bizarre colors. Many galaxies were identified based on morphology 
(for V < 21.6) and color (for fainter objects) — the number of galaxies was consistent with 
galaxy counts — but whether the error boxes contained a galaxy bright enough to be the 
expected host galaxy was not considered. 

Larson & McLean (1997) observed in the infrared nine of the smallest error boxes of 
classical bursts localized by the third Interplanetary Network (Larson, McLean & Becklin 
1996 presented a preliminary report on six error boxes); these error boxes are typically ~ 8 
arcmin^. In or near all but one error box they found at least one bright galaxy (K< 15.5). 
The fraction of the host galaxy luminosity function fainter than the brightest galaxy in each 
error box, the statistic introduced by Fenimore et al. (1993), has an average of 0.47±0.10, 
consistent with the value 0.5 which is expected if only host galaxies are present (Larson 
1997). Larson & McLean (1997) recognize that the error boxes are too large to discern 
between the host galaxy and unrelated background galaxies. However, they report that the 
surface density of bright galaxies is approximately a factor of two larger than the average, 
which they interpret as possible evidence for clustering around the host galaxy. 

Schaefer et al. (1997) searched the error boxes of 5 classical bursts with the Hubble 
Space Telescope (HST); 4 of the error boxes are small (of order ~ 1 arcmin^) and 3 are in 
the compendium of Schaefer (1992). This study also looked for, but did not find, unusual 
objects with ultraviolet excesses, variability, parallax or proper motion. Galaxies are 
present, but faint. Following Fenimore et al. (1993), Schaefer et al. estimated the distance 
to the host galaxies using the bursts' peak flux, with the distance scale determined from 
the burst cumulative intensity distribution. The brightest object in each field which could 
be a galaxy sets the upper limit on the host galaxy's brightness. An upper limit on the host 
galaxy's luminosity is derived from this observed brightness upper limit and the estimated 
distance; for the 4 small error boxes the luminosity upper limits are 10-100 times smaller 
than the luminosity of an L* galaxy. 

Thus the issue is not whether there are galaxies in burst error boxes, but whether the 
observed galaxies are likely host galaxy candidates. Galaxies may be present, but they may 
be fainter than the expected host galaxy brightness. Alternatively, the error box may be so 
large that the host galaxy is only one of the many expected unrelated background galaxies. 
The statistical analyses of the observations have thus far treated the brightest object within 
the error box as the host galaxy's brightness upper limit, even though many galaxies were 
detected. These analyses have not considered the size of the error box or the effect of the 
expected background galaxies. In addition, most analyses of error boxes treat the region 
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within a certain confidence contour, typically 99%, as equally likely, whereas in reality the 
location probability density peaks within the error box. 

Therefore, to evaluate whether the host galaxy is present we have developed and 
implemented a methodology which considers the location probability density (and thus the 
size of the error box), all the detected galaxies, and the presence of background galaxies. 
While this methodology was derived to determine whether host galaxies are present in 
burst error boxes, it can be applied to other candidate host objects, such as active galaxies 
(Luginbuhl ct al. 1996). Of course, the methodology can also be used in other astrophysical 
contexts where a counterpart in one wavelength band is sought for a source observed in 
a second band. Although the methodology is derived within a Bayesian framework, the 
resulting Bayesian "odds ratio" can be understood intuitively, and thus can be treated as 
a non-Bayesian statistic. The ultimate purpose of the odds ratio is to determine whether 
host galaxies are present, but it can also be used to determine which error boxes will 
have the power to answer this question. We use the standard notation where p{a \ b) is 
the conditional probability of proposition a given proposition b. Propositions are simple 
statements, the validity of which may or may not be in question. 

This methodology deals with error boxes which have a finite size and assumes that 
bursts occur within visible host galaxies. If the optical transients in the Beppo-SAX error 
boxes do indicate the locations of the GRB 970228 and GRB 970508 bursts, then these 
bursts (and subsequent similarly localized bursts) can be treated as having much smaller 
error boxes than previous bursts. Our methodology can be applied to these bursts by using 
as the error box the region within which a galaxy would be acceptable as a host. The 
GRB 970228 optical transient appears to sit on a shghtly extended source which may or 
may not be a galaxy (Djorgovski et al. 1997 report that an R=25.5 source is still present 
half a year after the burst). No extended source has been associated with the GRB 970508 
optical transient. This has revived suggestions that the burst source is expelled from the 
host galaxy (e.g., Lipunov et al. 1995; Bloom, Tanvir, & Wijers 1997). Source ejection 
can be treated within our methodology by expanding a burst's error box by the (model 
dependent) angular distance the burster would have traveled before bursting. 

Any analysis of the host galaxy issue depends on the expected host galaxy distribution. 
The examples presented here, the analyses published elsewhere and indeed most studies 
of the possible cosmological properties of burst ensembles use a "minimal" cosmological 
model. In this model bursts are assumed to be standard candles which do not evolve in 
comoving density or luminosity. Modeling the intensity distribution gives a unique mapping 
between a burst's intensity and its distance. Bursts occur in galaxies at a rate proportional 
to a galaxy's mass and (assuming a constant mass-to-light ratio for all galaxies) therefore 
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luminosity. This is most likely overly simplistic and in the future we will relax various 
assumptions of this minimal model. However here, to demonstrate our methodology, we 
will test the minimal cosmological model. 

In §2.1 we develop the methodology, which is dependent on the model host galaxy 
distribution, as discussed in §2.2. The abihty of the observations of a given error box to 

discriminate between the presence or absence of the host galaxy can be evaluated using this 
methodology (§2.3). Using the minimal cosmological model, we apply this methodology in 
§3 to the K-band observations of Larson & McLean (1997) and the HST observations of 
Schaefer et al. (1997). Finally in §4 we discuss important issues raised by our methodology. 



2. METHODOLOGY 

2.1. Likelihood Ratio 

The basic strategy for evaluating whether the host galaxy may be present uses the 
three-dimensional space consisting of the two sky coordinates and the optical flux / 
at each position. The detection threshold /um may vary over the searched region; for 
example, a bright star reduces the detectability of faint galaxies in the star's immediate 
vicinity. The searched region may not include the entire burst error box and may extend 
beyond it. Assume that Ud galaxies are detected, each with a flux fi located at An 
observed galaxy is either the host galaxy or an unrelated background galaxy. The searched 
portion of the flux-position space should be visualized as little bins, most of which arc 
empty; expressions are calculated for flnitc-sizcd bins which arc subsequently reduced to 
inflnitesimal dimensions. The probability for obtaining the observations is the product of 
the probabilities for the observed detection or nondetection in each bin. 

Let the distribution of background galaxies be dN = 0(/) dfdQ where we assume that 
these galaxies are distributed uniformly without any clustering. If it exists, the burster host 
galaxy is drawn from ^(/) which must be normalized to 1 since there can only be one host 
galaxy per error box. The burst localization results in a probability density p{fl) for the 
burst's position on the sky; p is also normalized to 1. Therefore, the probability that the 
host galaxy is at position Q with flux / is p(/, Q) = ^{f)p{Q). 

The three dimensional flux-position space is broken into bins with dimensions AQ 
and A/. The probability of flnding a background galaxy in a bin is governed by Poisson 
statistics; the expected number of galaxies in a bin centered on fj is rij — (f){fj)AfAQ. The 
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detection probabilities are 

p^.(n = 0)= e-"^ =e-W.)A/An 

p.^n = l)= rijC-''^ = 0(/j)A/Af]e-^(^^)^^^" . (1) 

The bin volumes A/ Ail are assumed to be small enough that no more than one background 
galaxy per bin need be considered, particularly since eventually A/AQ — > 0. The 
probability that the host galaxy is found in the jth bin is 

?,- = *(/,)p(Q,)A/AQ . (2) 

Note that the occurrence of the host galaxy is not governed by Poisson statistics since (by 
assumption) there is only one host galaxy. 

Now let ifhg be the hypothesis that the error box contains the burster's host galaxy in 
addition to background galaxies, while H^g is the hypothesis that only background galaxies 
are present. First we calculate the probability p{D \ i^bg) of obtaining the observed rid 
galaxies and not observing galaxies at all other values of / and Q (the data proposition D) 
assuming hypothesis ifbg- This probability p{D \ H\^g), the likelihood for i^bg) is the product 
of the probabilities of the observations in each bin. Thus 



^ e-San.>(/.OA/An^n,^^^^)^^^^ = ^ ^^o) '^^ ^^^^ ff </,(/,) A/AQ 



where in the argument of the exponential we have let A/ and Afi go to zero so that the 
sum becomes an integral. Note that 



{na)= dn df<P{f) (4) 

is the number of background galaxies expected to be observed within the searched region. 
The probability p{D \ H^g), the likelihood for H^g, is 



p{D\H^g)=p{D\H,,g) 



JdnJ^ df^{f)p{n) + Y: 



^^"■"^"^.,T,..^ , ^^ (/i)p(^i)A/AQ 



(5) 



where the first term in the brackets is the probabihty that the host galaxy was unobservable, 
and the terms in the sum are the probabilities that the ith observed galaxy is the host 
galaxy and not a background galaxy (the denominator cancels the factor in p{D \ i?bg) that 
described the host galaxy as a background galaxy). If there are galaxies in the error box. 
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then we have to consider the possibihty that any one of them (or none of them) might be 
the host galaxy; thus p{D \ H^g) is a sum of the probabihties for each possible occurrence 
of the host galaxy. If part of the error box is not observed, as often occurs in searches for 
contemporaneous counterparts (e.g., by GROCSE — Lee et al. 1997; Park et al. 1997a — or 
LOTIS — Park et al. 1997b), then /um is effectively infinite for the unobserved portion; the 
integral over Q in the first term should be over the entire error box. 

In standard "frequentist" statistics the ratio p{D \ Hhg)/p{D \ i/bg) can be used as a 
measure of how well the presence of a host galaxy explains the data. In Bayesian statistics 
the odds ratio 

updates the ratio of the "priors" p(i?hg)/p(-f^bg), the probabilities that i?hg and i?bg are 
true based on information available before obtaining the new data D, using the "Bayes 
factor" p{D \ H\^g)/p{D \ H^g), the ratio of the likelihoods. The values of the priors p(i7hg) 
and p{H\^g) depend on our assessment of the validity of the hypotheses, and in the absence 
of a strong preference for one hypothesis over the other, it is best to set p(iJhg)/p(/Jbg) = 1- 
A value of the odds ratio much larger than one favors the presence of a host galaxy, while a 
value much less than one indicates that no host galaxy is present; the observations cannot 
discriminate between the two hypotheses for a value of order unity. Clearly the search for 
host galaxies in multiple error boxes can be treated by the product of the likelihood ratios 
for each error box. In our case the ratio of the likelihoods for one error box is 



P{D I i/bg) 



=/.o/;-^^'^./^(/)p(o)+E™^ ■ (7) 



The host galaxy is more likely to be present but unobserved when a large fraction of the 
host galaxy flux distribution is below the detection limit (the first term in this expression). 
A detected galaxy is more likely to be the host galaxy than a background galaxy if there 
is a higher probability for the host galaxy to be present at that location and flux than a 
background galaxy. Note that a large value of p(Jli) indicates a small error box. Indeed, if 
the localization probability density is set to a constant within a region (e.g., within the 99% 
contour) , as is usually done, then p would have a value inversely proportional to the area of 
this region. However, by using p{Q) we allow the use of more information about the burst 
localization. 



-8- 



2.2. Distribution of Host GalcLxies 

The galaxy distributions required to decide whether a host galaxy is present are based 
on the distribution of galaxies as a function of flux and redshift, $(/, 2;). This distribution 
is observed directly, although cosmologists are ultimately interested in the distribution as a 
function of luminosity and not flux. The background galaxy flux distribution used here is 

= Jdz^f,z) . (8) 

Although currently known imperfectly, $(/, z) can be established empirically by redshift 
surveys; (f){f) is more easily determined directly from galaxy counts. 

We assume the host galaxy is not drawn from the same observed flux distribution as 
the background galaxies. While the background galaxies' flux distribution is observed (e.g., 
by galaxy count surveys), the host galaxy's flux distribution must be modeled. Here we 
develop the "minimal" cosmological model distribution; as discussed below (§4), reasonable 
variants have been proposed which may lead to different conclusions. First, it is likely that 
a galaxy's burst rate is proportional to the mass of the galaxy (Fenimore et al. 1993). 
Most cosmological theories attribute bursts to the mergers of two compact objects (e.g., 
neutron stars) within a binary system (Eichler et al. 1989; Narayan, Paczyhski, & Piran 
1994); the number of such objects is presumably proportional to the number of stars, and 
thus the mass. Assuming all galaxies are characterized by the same mass-to-light ratio, the 
background galaxy distribution should be weighted by the luminosity (or flux) to derive the 
host galaxy distribution. Second, the host galaxy may be modeled to fall within a redshift 
range C{z). Combining these two modeling factors gives 

io^dfj,^dzaz)fmz) ■ ^ ' 

The burst distance scale is unknown but has been modeled using intensity distributions 
(e.g., Fenimore et al. 1993). An intensity quantity G intrinsic to the burst (e.g., peak 
photon luminosity or total energy emitted) is considered to be a fundamental burst property 
(perhaps a constant); however, the normalization of G is unknown. Cosmology modifies the 
Euclidean rf"^ relationship between G and the related observed intensity quantity g (e.g., 
peak photon flux or energy fluence). Thus, the normalization of G and the distance scale 
are derived from the distribution of g (as are other modeling parameters). The resulting 
relationship can be inverted to calculate a range of likely distances for a given observed 
value of g. While most models assume that G is a standard candle and that the source 
density is constant per comoving volume, in the more general case the density may evolve, 
G may be characterized by a luminosity function, and a K-correction may be necessary to 
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relate g and G (the energy band of the observation is redshifted) . Ultimately this modeling 
should produce the distribution of sources dN/dz as a function of redshift and g. Finally, 



dN 
dz 



5W = j^ . (10) 

In most simple model is used. Thus ^{z) is assumed to be a delta function, 

providing a direct mapping between the peak photon flux or the energy fluence and the 
redshift. Given the uncertainties of modeling the host galaxy distribution, the shape of the 
galaxy luminosity function at a given redshift $(/, z) can be approximated by a Schechter 
function (Peebles 1993, p. 120), 

^(l/) = ^ol/"e-^ , (11) 

where y = L/ = f / f*. The intensity scale, or is typically measured as the 
absolute magnitude in a given spectral band. Note that a K-correction should be applied 
for redshifts more than a few tenths (the flux observed in a given energy band was 
emitted by the galaxy in a different band); in addition, the luminosity function underwent 
evolution both in scale (i.e., the value of L*) and normalization. Recent surveys flnd: 
for the bj band M^. = -19.72 ± 0.09 and a = -1.14 ± 0.08 (Ratcliffc ct al. 1997); for 
the K-band = —23.12 and a = —0.91 (Gardner et al. 1997); and for the R-band 
Mr = -20.29 ± 0.02 and a = -0.70 ± 0.05 (Lin et al. 1996). To all these expressions for 

must be added an additional term blogh, where h = ifo/(100 km s~^ Mpc~^), resulting 
from the uncertainty in Hubble's Constant Hq; however, this dependence on the value of Hq 
is cancelled by the Hq dependence in the relationship between the flux and z (M* is derived 
from observations of magnitude vs. redshift). When we do need i?o, we use h — 0.75. Since 
a is of order -1, we approximate "^if) as a simple exponential: 

*(/) = exp[-///,]//, . (12) 

Using the approximation for small values of z, 

= + 51og[3 X 10^^] and f^z) = /olO-°-^^*[3 x 10^;^]"^ , (13) 

where /o is the normalizing flux (i.e., the flux of a magnitude object) for a given band, 
and m* is the apparent magnitude corresponding to /*. 



2.3. Sensitivity 



We can evaluate how well our methodology discriminates between the presence or 
absence of a host galaxy in a given error box. Since (f){f) and ^(/)p(Q) are the probabilities 



of the presence of a background galaxy and the host galaxy, respectively, in a given patch 
of sky Q at a flux /, the expected value of the Bayes factor is 




m 



(14) 



if a host galaxy is present. If there is no host galaxy then both the second term in the 
brackets (i.e., ^{f)p{Q)) in the first equation of eq. (14) and the integral in the second 
equation should not be included. Thus on average {p{D \ H\^^/p{D \ i^bg)) = 1 without 
a host galaxy, which may seem surprising, but results from background galaxies being 
occasionally mistaken for the host galaxy. Under the hypothesis iJbg that there are no 
host galaxies, p{D \ H\^^/p{D \ i^bg) "will be less than 1 in most error boxes. However, in 
those cases where a background galaxy with the flux expected for the host galaxy falls in 
the error box, p{D \ Hhg)/p{D \ ifbg) will be greater than 1. The more unhkely such an 
occurrence, the smaller the value of and therefore the larger the value of the second term 
of p{D I Hy^^/p{D I ifbg) when a background galaxy is mistaken for the host galaxy. 

The methodology's power to determine that a host galaxy is present in an error box 
for a given observation depends on the value of / dO. //^^(q) df '^{fY p{VlY / (j){f), the term 
added by the presence of a host galaxy. To evaluate this expression we assume that a 
constant probability error box (i.e., p — I/JIq over an area Qq) is searched to a uniform 
detection threshold /um. We approximate the cumulative galaxy count distribution over 
the error box as A^(> /) = (///b)"^^^ (the distribution expected for a constant galaxy 
density in three-dimensional Euclidean space), where fh is the flux at which one background 
galaxy is expected in the error box. Since A^(> /) oc VLq, fh will vary from error box to 
error box as h oc From N{> f) we derive 0(/) = (3/2/(.fio) (///&) As discussed 

in §2.2, we use a weighted Schechter function for the host galaxy distribution function, 
*(/) — exp [—///*]//*. Consequently if a host galaxy is present 



where V{a,x) is the incomplete gamma function. Since most of the area under the curve 



^5/2g u jg above u — 1, the value of F (7/2, 2/iiin//*), and therefore of {p{D \ i^hg) /p{D \ i^bg)) 





3/2 



(15) 
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ls relatively insensitive to /um as long as fum/ f* < 1/2, that is, when the detection threshold 
is less than the expected host galaxy flux. On the other hand, {p{D \ Hhg)/p{D \ i^bg)) is 
very sensitive to the value of ft,, the flux at which we expect one background galaxy in the 
error box. Since 1(7/2) = 3.32335, {p{D \ Htg)/p{D \ H^^)) ~ 1 + 0.1958{f ^/ fbf/'^ . Our 
methodology (and most likely any methodology) can discriminate between the presence and 
absence of a host galaxy in given error box when, on average, the host galaxy is expected 
to be much brighter than the brightest background galaxy. 

These results were derived analytically by integrating over the number of galaxies 
detected in the error box. This integration treats the number of galaxies as a continuous 
quantity. We have verified the results with simulations with discrete bursts made under 
the same assumptions that led to eq. (15). Define a = /um//* and P = fb/ f*- Then our 
analytic formalism gives 

K) = exp(-a) , (ub) = , {Ob) = 1 , 

(O,) = 1 + 0-1958/3-^^'^i^|§^ ' (16) 

where Uh and rib are the numbers of host galaxies and background galaxies, respectively, 
detected in an error box. (Oh) {{Ob)) is the odds ratio if host and background galaxies 
(only background galaxies) are present, assuming the prior ratio is 1. In each trial of a 
given simulation we generated one host galaxy with normalized fiux eh = fh/ f* and 100 
background galaxies with normalized fiuxes = fi/ f*, where all the e values were drawn 
from the appropriate distribution functions. Then 

100 5/2 5/2 

{Ob)^l-e-'^ + j:^j^J{e,-a) and (O,) = (O,) + |^^(e, - a) , (17) 

where 9{x) is the Heaviside function which is used here to enforce the requirement that a 
galaxy be detectable to be included. The results of our simulations are provided by Table 1. 
As can be seen, the agreement with the analytic formulae in eq. (16) is very good. The 
largest deviations occur when {ub) is very small, and even 10^ trials do not provide sufficient 
statistics. 



3. APPLICATION 



In this section we apply our methodology to observations of gamma-ray burst error 
boxes reported in the literature. Not all details are provided in these publications, 
and we make a number of simplifying assumptions; for example, a constant probability 
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density over the error box is used. We write the ratio of hkehhoods (eq. [7]) as 
p{D I H^s)/p{D I //bg) = gi + EUi Q2i where 



no 

and 



q^^ jdn df = 1 - ^e-^"-/^* (18) 



_ ^(/,)p(Q,) _ e-/^//- 

In these expressions is the size of the error box (p oc I/JIq) a-nd Jl^ < is the size of the 
searched region. Since we are testing the "minimal" cosmological model, for the host galaxy 
distribution function ^(/) we use the weighted Schechter function in eq. (12). We derived 
the distribution of background galaxies </>(/) from the review paper by Koo & Kron (1992). 
In their Figure 1 they compile galaxy counts in the K, R and bj bands from a number of 
different sources. We parameterized the differential galaxy count distribution by choosing 
values which fell within the cluster of observed data points. 



3.1. K-Band Observations of Larson &: McLean (1997) 

Larson & McLean (1997) observed the error boxes of 9 recent bursts localized by the 
third Interplanetary Network as well as a number of control fields, primarily in the K band. 
Many of the error boxes were only partially observed because of corrections to the error 
boxes after the observations. They found many relatively bright galaxies both within and 
immediately outside of the error boxes. The error boxes are typically ~ 8 arcmin^, and 
background galaxies are expected; Larson & McLean recognize that they cannot distinguish 
between host and background galaxies in their data. Based on their observations of control 
fields and galaxy counts from the literature, they report that the overall galaxy density in 
and near the error boxes is about a factor of 2 greater larger than average, which might 
result from clustering around the host galaxy. 

Although Larson & McLean recognize that their observations are consistent with, but 
do not prove, the existence of host galaxies, their earlier work (Larson, McLean & Becklin 
1996) was interpreted as showing there was not a host galaxy issue. Therefore we analyzed 
their observations to determine what statement could be made about the existence of host 
galaxies. In Table 2 we list for each of the nine error boxes studied by Larson & McLean the 
size of the box, the fraction of the box covered by the observations, and the K magnitude 
of the brightest galaxy in the box. We assume the probability density p is constant within 
the error boxes. Larson (1997) provides an estimated redshift for these bursts. Ideally, we 
would use a list of magnitudes for all detected galaxies above the limiting magnitude (here 
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K ~ 18.5), but here we only have the brightest galaxy. Therefore, we use the magnitude 
of the brightest galaxy as both the limiting magnitude and the magnitude of the single 
detected galaxy for each box. 

The flux at which we expect to find background galaxies is /& = 5.8 x IO^^^Qq''^ ~ 
2.32 X 10~^^ erg s~^ cm~^ Hz~^ for Qq ~ 8 arcmin^. Therefore and the flux of the 

detection threshold (which is also the flux of the brightest galaxy) are all comparable. In 
Table 2 we compute the various terms of the likelihood ratio. The first term, gi, which is 
also the probability that the host galaxy is fainter than the detection threshold, increases 
when the box is not observed completely. The product of the likelihood ratios gtot foi' each 
error box is the likelihood ratio for the ensemble; here the product is 0.248, which indicates 
that we cannot determine whether the expected host galaxies are present or absent. 



3.2. HST Observations of Schaefer et al. (1997) 

Schaefer et al. (1997) observed four small error boxes with the WFPC2 on HST in both 
the B and UV bands (they also studied a much larger error box with the FOC; this error 
box is not considered here). Here we apply our methodology to their B-band observations; 
in our analysis we use bj distribution functions. In each case a detection threshold is given. 
Sources are found in three of the four error boxes, but we consider only the small number 
of sources which are identified as galaxies. 

Two of the error boxes were only partially observed. Schaefer et al. stated that 
ground-based observations of the GRB 790613 error box show there are no sources HST 
would have detected in the 10% of the error box which was not observed. Similarly, Schaefer 
(1997, private communication) reports there are no galaxies to the i^S'T detection limit in 
the ~ 15% of the error box unobserved by HST. Thus Qg — (see eq. [18]) for these two 
boxes. 

Note that Larson (1997) and Schaefer et al. (1997) give z values for GRB 920406 which 
differ by a factor of two. Larson uses redshifts provided by E. Fenimore, while Schaefer 
et al. give distances calculated from peak energy fiuxes assuming a peak luminosity of 
6 X 10^° ergs s~^. This peak luminosity was calculated for Hq — 75 km s~^ Mpc~^, which 
we used to convert the distances to redshifts. 

Table 3 presents the results of applying our methodology to the HST observations. 
As Schaefer et al. concluded, the detection thresholds are sufficiently fainter than the 
expected f^, such that the host galaxy should have been observed. In addition the observed 
galaxies are as faint as the expected background galaxies for such small error boxes; for 
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a threshold of B—2S we expect ~ 3 galaxies per arcmin^. The product of the likehhood 
ratios is 2 x 10~^, which indicates that these observations strongly favor the hypothesis 
that only background galaxies and no host galaxies are found in the error boxes observed 
by HST. Note that fb = 7.7 x IQ-^^fll^^ erg s'^ cm'^ Hz"! and thus /* > /;,; by §2.3 our 
methodology should be able to determine that host galaxies are present in these error boxes. 

4. DISCUSSION 

Almost all the quantities required for analyzing an error box are obscrvablcs, although 
they may be difficult to derive and may be characterized by large uncertainties. However, 
the host galaxy distribution function is model dependent, and any conclusions based on 
applying our methodology is a statement about the validity of the host galaxy model. Here 
we assume that bursts occur within galaxies, and that the number of sources is proportional 
to the luminosity of the galaxy. This model is consistent with the scenario where bursts 
result from the merger of neutron star binaries. In analyzing the observations from the 
literature we used the "minimal" cosmological model, although in §2.2 we outline how a 
distribution for the source redshift can be derived. 

As discussed above, this methodology can also be applied to scenarios where the burst 
source is ejected from the host galaxy by expanding the error box by the angular distance 
the source may have traveled before bursting. For arcmin^ scale error boxes, broadening 
the error box may be inconsequential, but for the well-localized bursts resulting from the 
newly-discovered optical transients, the size of the error box may be determined primarily 
by the distance the source traveled before bursting. 

While the minimal cosmological host galaxy model we use is consistent with current 
modeling assumptions in studies of the burst database, reasonable variants can result 
in significantly different conclusions, and the analysis of an error box may have to be 
revisited as the source models evolve. For example, the observed burst distribution may be 
characterized by a broad luminosity function (Horack ct al. 1994, 1996 and Hakkila ct al. 
1995, 1996 found that the luminosity function must be narrow, but Loredo and Wassernian 
1997a,b and Brainerd 1997 dispute this conclusion), allowing a given burst to originate over 
a broad redshift band. The source density may be greater in galaxies which have undergone 
starbursts, which may favor small galaxies at moderate redshifts (Sahu et al. 1997). 

Our methodology compares two hypotheses: 1) a host galaxy is present in addition to 
background galaxies; or 2) only background galaxies are present. This analysis is applied 
to each error box independently; the product of the likelihood ratios compares these 
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hypotheses for the data set as a whole. This methodology does not test directly whether the 
candidate host galaxies are indeed distributed according to the model distribution function. 
Fenimore et al. (1993) introduced a test which considers the distribution of the statistic 
S = Jq' df^![f), where fi is the flux of the brightest galaxy in, or the detection threshold 
for, the iih. error box. If the fi indeed correspond to the host galaxies, then the statistic S 
should be distributed uniformly between and 1, and should have an average value of 1/2. 
However, this test does not evaluate whether /j corresponds to the host galaxy. 

5. SUMMARY 

We have developed a methodology to evaluate whether the detections and nondetections 
of galaxies in gamma-ray burst error boxes are consistent with the presence of the burst 
source's host galaxy, or whether all the detections can be attributed to unrelated background 
galaxies. This methodology relies on the distribution of background galaxies, which is 
observed, and the flux distribution for the host galaxy, which must be modeled. Any 
conclusions are dependent on the host galaxy model. In addition to evaluating the 
observations of a particular error box, our methodology also predicts its likely sensitivity 
for a given error box. Of course, the methodology can be used for other candidate host 
sources, or for similar astrophysical problems. 

The methodology allows the maximal use of observational data. Thus a variable 
probability density for the burst's location can be used instead of assuming that this 
probability is constant across the error box. The detection threshold is allowed to vary 
across the error box; this permits the treatment of partially observed error boxes. 

As examples we applied this methodology to the K-band observations of Larson & 
McLean (1997) and the if5T observations of Schaefer et al. (1997). Larson & McLean 
found a large number of galaxies in or near 9 error boxes, but our analysis shows that these 
detections can easily be explained as background galaxies. We find that the error boxes 
Larson et al. observed were too large to discriminate between the presence or absence of 
a host galaxy. Schaefer et al. (1997) found only faint galaxies; our analysis shows that 
brighter host galaxies should have been observed, but were not. Our sensitivity analysis 
verifies that these error boxes are small enough to determine that a host galaxy is indeed 
present. Therefore, within the assumptions of the "minimal" cosmological model used in 
our analysis (e.g., standard candle bursts; no evolution; the number of potential sources is 
proportional to the size of the galaxy), the expected host galaxies are absent. 

In the future we will apply this methodology to other observations of small error boxes. 
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and explore other host models. 

We thank F. Vrba and C. Luginbuhl for stimulating discussions. We appreciate the 
comments and encouragement of the referee, B. Schaefer. D. Band's gamma-ray burst 
research is supported by the CGRO guest investigator program and NASA contract 
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Table 1. Sensitivity Simulations 







-Atrial 


Sim.h Calc' 


Sim.h Calc' 


Sim.h Calc' 


Sim.h Calc' 


1. 


1. 


10-* 


0.3677 


0.3679 


1.0065 


1.0000 


1.0014 


1. 


1.1537 


0.9325 


0.1 


1. 


10* 


0.9030 


0.9048 


31.610 


31.623 


0.9908 


1. 


1.1872 


1.1955 


1.0 


0.1 


10* 


0.3611 


0.3679 


0.0298 


0.0316 


0.9977 


1. 


5.7152 


5.6079 


0.1 


0.1 


10* 


0.8985 


0.9048 


1.0016 


1.0000 


1.0269 


1. 


7.1515 


7.1901 


0.01 


0.1 


10* 


0.9888 


0.9900 


31.558 


31.623 


0.9982 


1. 


7.1540 


7.1917 


1. 


0.01 


10" 


0.3618 


0.3679 


0.0010 


0.0010 


0.9923 


1. 


150.43 


153.48 


0.1 


0.01 


10" 


0.9053 


0.9048 


0.0334 


0.0316 


0.8232 


1. 


196.68 


196.78 


0.01 


0.01 


10" 


0.9897 


0.9900 


1.0109 


1.0000 


0.8119 


1. 


195.91 


196.83 


0.001 


0.01 


10* 


0.9987 


0.9990 


31.627 


31.623 


0.9087 


1. 


196.88 


196.83 


1. 


0.001 


10* 


0.3718 


0.3679 


0.0000 


0.0000 


0.6321 


1. 


4901.4 


4829.7 


0.1 


0.001 


.3 X 10-1 


0.9079 


0.9048 


0.0011 


0.0010 


0.7906 


1. 


6186.0 


6192.1 


0.01 


0.001 


3 X 10* 


0.9905 


0.9900 


0.0034 


0.0316 


1.1030 


1. 


6191.3 


6193.7 


0.001 


0.001 


10* 


0.9997 


0.9990 


1.0064 


1.0000 


1.0062 


1. 


6203.2 


6193.7 


10-4 


0.001 


10* 


0.9998 


0.9999 


31.654 


31.623 


1.0161 


1. 


6135.5 


6193.7 



''The ratio /um//*- 
''The ratio /(,//*. 

■^The number of trials in the simulation. 

'^The fraction of the trials in which the host galaxy is detected. 
''The average number of background galaxies detected per error box. 

'^The average value of the odds ratio when only background galaxies are present, and the ratio of priors is 

set to unity. 

SThe average value of the odds ratio when the host galaxy is present in addition to background galaxies. 
The ratio of priors is set to unity. 

''Results of simulation. 

'Calculated using eq. (16). 
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Table 2. Analysis of the Larson & McLean (1997) Observations 



Burst 




^0° 


Cov.<= 


Ki 


/(Ki)d 










?1* 


92^'' 


qtot' 


GRB 910122 


0.19 


19.33 


0.41 


15.3 


4.70 X 


10" 


■27 


3.38 X 


10" 


-27 


1.87 X 


10+25 


0.898 


0.203 


1.102 


ORB 910219 


0.20 


7.30 


1.0 


15.7 


3.25 X 


10" 


-27 


3.05 X 


10" 


-27 


4.49 X 


10+25 


0.656 


0.344 


1.000 


GRB 920325 


0.22 


26.5 


0.87 


14.0 


1.56 X 


10- 


-26 


2.52 X 


10- 


-27 


1.22 X 


10+24 


0.998 


0.025 


1.024 


GRB 920406 


0.12 


1.65 


1.0 


17.1 


8.96 X 


10~ 


-28 


8.47 X 


10" 


-27 


8.26 X 


10+26 


0.100 


0.078 


0.178 


GRB 920501 


0.17 


2.60 


0.91 


14.8 


7.45 X 


10- 


-27 


4.22 X 


10" 


-27 


6.41 X 


10+24 


0.844 


2.429 


3.274 


GRB 920525 


0.15 


3.54 


1.0 


17.0 


9.83 X 


10- 


-28 


5.42 X 


IQ- 


-27 


6.87 X 


10+26 


0.166 


0.063 


0.229 


GRB 920711 


0.23 


2.50 


0.93 


15.5 


3.91 X 


10" 


-27 


2.31 X 


10" 


-27 


2.90 X 


10+25 


0.830 


1.097 


1.927 


GRB 920720 


0.20 


3.78 


0.57 


15.5 


3.91 X 


10- 


-27 


3.05 X 


10- 


-27 


2.90 X 


10+25 


0.842 


0.830 


1.672 


GRB 920723 


0.11 


4.46 


0.88 


16.0 


2.47 X 


10- 


-27 


1.01 X 


10" 


-26 


8.65 X 


10+25 


0.311 


0.201 


0.512 



''Burst redshift from Larson (1997). 
''Size of the error box in arcmin2. 

■^Fraction of the error box covered by the observations. 

"^The flux corresponding to a given K magnitude, using a K=0 flux of 0.62 X 10—20 grg s" ^ cm" 2 Hz" ^. 

"The K-band flux (erg cm-2 Hz"^) corresponding to an L* galaxy at a given redshift based on Gardner et al. 
(1997). 

^Thc diff'crcntial galaxy distribution (galaxies aj:cmin-2 flux-^) based on the cumulative distribution in Figure 1 of Koo 
& Kron (1992). 

SThe probability that the host galaxy is fainter than the detection limit. 
''The value of q2i for the detected galaxy. 

'Total likelihood ratio for the error box, the sum of qi and all the q2i- 
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Table 3. Analysis of the Schaefer et al. (1997) Observations 



Burst 


00=- 










/(Slim)" 


Bi 










q2i^ 


"Jtot' 


GRB 790325 


2.0 


0.145 


1.81 


X 


10" 


-27 


23.0 


2.80 


X 


10" 


-29 


22.53 


4.32 


X 


10" 


-29 


4.53 


X 


1028 


0.015 


0.006 


0.021 


GRB 790406 


0.26 


0.115 


2.88 


X 


10- 


-27 


22.8 


3.37 


X 


10- 


-29 


















0.012 




0.012 


GRB 790613 


0.76 


0.100 


3.81 


X 


10" 


-27 


23.2 


2.33 


X 


10" 


-29 


21.29 


1.35 


X 


10" 


-28 


4.61 


X 


1027 


0.006 


0.072 




























21.61 


1.01 


X 


10- 


-28 


8.32 


X 


1027 




0.040 




























21.57 


1.05 


X 


10" 


-28 


7.73 


X 


1027 




0.044 


0.162 


GRB 920406 


2.0 


0.263 


5.53 


X 


10" 


-28 


23.0 


2.80 


X 


10" 


-29 


















0.049 




0.049 



^Sizc of the error box in arcmin2. 

''Burst redshift converted (assuming h = 0.75) from distance given by Schaefer et al., which in turn is based on the peak flux. 
■^The B-band flux (erg s~^ cm~2 Hz~^) corresponding to an L« galaxy at a given redshift based on RatclifTe et al. (1997). 
''The limiting B-magnitude. 

''The flux corresponding to a given B magnitude, using a B=0 flux of 4.44 x 10~20 erg s~^ cm~^ Hz~^. 

*The differential galaxy distribution (galaxies axcmin"^ flux~^) based on the cumulative distribution in Figure 1 of Koo & Kron (1992). 
^The probability that the host galaxy is fainter than the detection limit. The value is reported on the first line for a given error box. 

''The value of q2i for the detected galaxy. 

'Total likelihood ratio for the error box, the sum of gi and all the q2i- The resulting value is reported on the last line for a given error 
box. 



